50 research outputs found
Toward portable information extraction
EThOS - Electronic Theses Online ServiceGBUnited Kingdo
GATE Teamware: A Web-based Collaborative Text Annotation Framework
This paper presents Teamware, a generic, open-source web-based collaborative annotation framework which enables users to carry out complex corpus annotation projects, involving teams of annotators working remotely. Different roles are defined (annotator, manager, administrator), allowing access to different functionalities. Documents may be pre-processed automatically, so that human annotators can begin with text that has already been pre-annotated and thus making them more efficient. The user interface is simple to learn and aimed at non-experts. It has been evaluated through the creation of several gold standard corpora and internal projects, as well as through external evaluation in commercial and EU annotation projects. It is available as on-demand service on GateCloud.net, as well as open-source for self-installation
SEMANTIC ANALYSIS FOR TOMORROW’S AUDIO-VISUAL DIGITAL ARCHIVES
natural language processing PrestoSpace 1 is a European-funded research project that aims at addressing the problem of decaying audio-visual archives throughout Europe by means of digitisation for preservation and access. One of the work areas within the project is Metadata Access and Delivery (MAD) which employs innovative methods of generating metadata for the digitised media in order to enhance the resulting archives and to ease access to the stored material. One such method is the use of automatic semantic analysis using natural language processing techniques in the process of creating analytical metadata for the preserved essence. 1
An Empirical Investigation of the Relation Between Discourse Structure and Co-Reference
We compare the potential of two classes of linear and hierarchical models of discourse to determine co-reference links and resolve anaphors. The comparison uses a corpus of thirty texts, which were manually annotated for co-reference and discourse structure